Concept Relation Discovery and Innovation Enabling Technology (CORDIET)

نویسندگان

  • Jonas Poelmans
  • Paul Elzinga
  • Alexey Neznanov
  • Stijn Viaene
  • Sergei O. Kuznetsov
  • Dmitry I. Ignatov
  • Guido Dedene
چکیده

Concept Relation Discovery and Innovation Enabling Technology (CORDIET), is a toolbox for gaining new knowledge from unstructured text data. At the core of CORDIET is the C-K theory which captures the essential elements of innovation. The tool uses Formal Concept Analysis (FCA), Emergent Self Organizing Maps (ESOM) and Hidden Markov Models (HMM) as main artifacts in the analysis process. The user can define temporal, text mining and compound attributes. The text mining attributes are used to analyze the unstructured text in documents, the temporal attributes use these document’s timestamps for analysis. The compound attributes are XML rules based on text mining and temporal attributes. The user can cluster objects with object-cluster rules and can chop the data in pieces with segmentation rules. The artifacts are optimized for efficient data analysis; object labels in the FCA lattice and ESOM map contain an URL on which the user can click to open the selected document.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

CDUD ’ 11 – Concept Discovery in Unstructured Data

ing Concepts from Text Documents by Using an Ontology . . . . . . 21 Ekaterina Cherniak, Olga Chugunova, Julia Askarova, Susana Nascimento and Boris Mirkin Extraction and Use of Opinion Words for Three-Way Review Classification Task . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 31 Ilia Chetviorkin and Natalia Loukachevitch Constructing Galois La...

متن کامل

A System for Knowledge Discovery in Big Dynamical Text Collections

Software system Cordiet-FCA is presented, which is designed for knowledge discovery in big dynamic data collections, including texts in natural language. Cordiet-FCA allows one to compose ontology-controlled queries and outputs concept lattice, implication bases, association rules, and other useful concept-based artifacts. Efficient algorithms for data preprocessing, text processing, and visual...

متن کامل

Retrieval of Criminal Trajectories with an FCA-based Approach

In this paper we briefly discuss the possibilities of Formal Concept Analysis for gaining insight in large amounts of unstructured police reports. We present a generic human centred knowledge discovery approach and showcase promising results obtained during empirical validation. The first case study focusses on distilling indicators for identifying domestic violence from 4814 reports with the a...

متن کامل

Performance comparison of four commercial GE discovery PET/CT scanners: A monte carlo study using GATE

  Combined PET/CT scanners now play a major role in medicine for in vivo imaging in oncology, cardiology, neurology, and psychiatry. As the performance of a scanner depends not only on the scintillating material but also on the scanner design, with regards to the advent of newer scanners, there is a need to optimize acquisition protocols as well as to compare scanner ...

متن کامل

Big Data-driven Technology Innovation: Concept and Key Problems

In the background of big data, technological innovation has met some new opportunities and challenges. Based on expounding the concept and key technologies of big data, the concept, main data resources and characteristics of data driven technological innovation are analyzed. And some key problems of data driven technological innovation are discussed from technological and management perspective...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • CoRR

دوره abs/1202.2895  شماره 

صفحات  -

تاریخ انتشار 2011